A Scalable Cross-Language Metasearch Architecture* for Multilingual Information Access on the Web
نویسندگان
چکیده
This position paper for the special session on "Multilingual Information Access" comprises of three parts. The first part reviews possible demands for Multilingual Information Access (hereafter, MLIA) on the Web, and examines required technical elements. Among those, we, in the second part, focus on Cross-Language Information Retrieval (hereafter, CLIR), particularly a scalable architecture which enables CLIR in a number of language combinations. Such a distributed architecture developed around XIRCH project (an international joint experimental project currently involves NTT, KRDL, and KAIST) is then described in a certain detail. The final part discusses some NLP/MT related issues associated with such a CLIR architecture.
منابع مشابه
Web-Based Information Access: Multilingual Automatic Authoring
The needs for managing similar documents in different languages increases with the growing amounts of electronic information available in documents of the same type (e.g. news streams). This paper proposes a viable approach to information access emphasizing the hypertextual paradigm in a multilingual framework. This task of processing/structuring text so that cross-lingual hypertext links are g...
متن کاملModern Multilingual and Cross-lingual Information Access Technologies
In this chapter, we describe the state of the art cross-lingual and multilingual strategies and their related areas. In particular, we show a WWW-based information system called MIETTA, which allows uniform and multilingual access to heterogeneous data sources in the tourism domain. The design of the search engine is based on a new cross-lingual framework. The framework integrates a cross-lingu...
متن کاملArchitectural Design of WebScales - A Large-Scale Metasearch Engine
It is estimated that there are hundreds of thousands of information sources on the Web, including both the Surface Web and the Deep Web. Most of these sources have their own search capabilities. In order to alleviate ordinary users from the formidable task of identifying useful sources and search them individually, it is important to provide a unified access to these sources. Metasearch engine ...
متن کاملChallenges for the multilingual Web of Data
The Web has witnessed an enormous growth in the amount of semantic information published in recent years. This growth has been stimulated to a large extent by the emergence of Linked Data. Although this brings us a big step closer to the vision of a Semantic Web, it also raises new issues such as the need for dealing with information expressed in different natural languages. Indeed, although th...
متن کاملExploring the Effects of Language Skills on Multilingual Web Search
Multilingual access is an important area of research, especially given the growth in multilingual users of online resources. A large body of research exists for Cross-Language Information Retrieval (CLIR); however, little of this work has considered the language skills of the end user, a critical factor in providing effective multilingual search functionality. In this paper we describe an exper...
متن کامل